Integrating Syntactic and Semantic Tools in Sfy
نویسنده
چکیده
Despite the large lexicon of even broad coverage Natural Language Processing systems, there are many missing lexical items. We describe two methodologically distinct approaches to augment the lexicon: the look-up approach and the shotgun approach. Both approaches are framed within Sfy, a new research program to pull together several broad coverage systems. The classic look-up approach requires compatible electronic sources (e.g. WordNet). Through system hooks into the source, we can pull as much relational and semantic information as Sfy requires. Since WordNet does not overtly provide Sfy with sufficient syntactic role and semantic information, we use the synonyms of an unknown target word that WordNet provides us with to create templates for a new entry in our lexicon. In a fully realistic model, we cannot always rely on the lookup approach to solve our lexical issues. We need another back-off method. Much like a person intuiting the part-ofspeech of a new term, an unknown word presents a syntactic hole to our parser. Only certain parts of speech will fill that hole. We can try to validate all the possible fillers by naively testing every part of speech. The subset of all these sentences that can be parsed informs us of exactly which parts of speech our unknown word can be. Interestingly, the shotgun approach also provides a means to solve the related problem of a familiar orthogrpahic word functioning in an unfamiliar part of speech.
منابع مشابه
Verbs in Applied Linguistics Research Article Introductions: Semantic and syntactic analysis
This study aims to investigate the semantic and syntactic features of verbs used in the introduction section of Applied Linguistics research articles published in Iranian and international journals. A corpus of 20 research article introductions (10 from each journal) was used. The corpus was analysed for the syntactic features (tense, aspect and voice) and semantic meaning of verbs. The finding...
متن کاملVerbs in Applied Linguistics Research Article Introductions: Semantic and syntactic analysis
This study aims to investigate the semantic and syntactic features of verbs used in the introduction section of Applied Linguistics research articles published in Iranian and international journals. A corpus of 20 research article introductions (10 from each journal) was used. The corpus was analysed for the syntactic features (tense, aspect and voice) and semantic meaning of verbs. The finding...
متن کاملبرچسبزنی نقش معنایی جملات فارسی با رویکرد یادگیری مبتنی بر حافظه
Abstract Extracting semantic roles is one of the major steps in representing text meaning. It refers to finding the semantic relations between a predicate and syntactic constituents in a sentence. In this paper we present a semantic role labeling system for Persian, using memory-based learning model and standard features. Our proposed system implements a two-phase architecture to first identify...
متن کاملبرچسبزنی خودکار نقشهای معنایی در جملات فارسی به کمک درختهای وابستگی
Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...
متن کاملManipulation in advertising text: lexical and semantic aspect
The present paper focuses on the questions of modern advertising science, structure of advertising and elements making actual manipulative influence from the addresser. Advertising encourages product sales, is an instrument of forming ethical standards, values, creating cultural values, standards and mode of behavior that is why the wide system of means for achieving aims of advertisers is need...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008